Large-scale microbiome data integration enables robust biomarker identification

نویسندگان

چکیده

Abstract The close association between gut microbiota dysbiosis and human diseases is being increasingly recognized. However, contradictory results are frequently reported, as confounding effects exist. lack of unbiased data integration methods also impeding the discovery disease-associated microbial biomarkers from different cohorts. Here we propose an algorithm, NetMoss, for assessing shifts network modules to identify robust associated with various diseases. Compared previous approaches, NetMoss method shows better performance in removing batch effects. Through comprehensive evaluations on both simulated real datasets, demonstrate that has great advantages identification disease-related biomarkers. Based analysis pandisease studies, there a high prevalence multidisease-related bacteria global populations. We believe large-scale will help understanding role microbiome more perspective accurate biomarker greatly promote microbiome-based medical diagnosis.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Classification of Protein Variation Using Structural Modeling and Large-Scale Data Integration

Existing methods for interpreting protein variation focus on annotating mutation pathogenicity rather than detailed interpretation of variant deleteriousness and frequently use only sequence-based or structure-based information. We present VIPUR, a computational framework that seamlessly integrates sequence analysis and structural modeling (using the Rosetta protein modeling suite) to identify ...

متن کامل

Robust classification of protein variation using structural modelling and large-scale data integration

Existing methods for interpreting protein variation focus on annotating mutation pathogenicity rather than detailed interpretation of variant deleteriousness and frequently use only sequence-based or structure-based information. We present VIPUR, a computational framework that seamlessly integrates sequence analysis and structural modelling (using the Rosetta protein modelling suite) to identif...

متن کامل

Integrated Robust Identification and Control of Large-Scale Processes

We propose the use of pseudo-singular values, which are closely related to singular values but are allowed to have sign, as a convenient approach for developing techniques for the identification and control of large-scale processes. Steady-state controllability can be assessed directly in terms of the pseudo-singular values. It is shown that to control an output disturbance direction with zero ...

متن کامل

Microfluidic large-scale integration.

We developed high-density microfluidic chips that contain plumbing networks with thousands of micromechanical valves and hundreds of individually addressable chambers. These fluidic devices are analogous to electronic integrated circuits fabricated using large-scale integration. A key component of these networks is the fluidic multiplexor, which is a combinatorial array of binary valve patterns...

متن کامل

An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity

The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Nature Computational Science

سال: 2022

ISSN: ['2662-8457']

DOI: https://doi.org/10.1038/s43588-022-00247-8